AITopics | count letter

Collaborating Authors

count letter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Can ChatGPT Learn to Count Letters?

Conde, Javier, Martínez, Gonzalo, Reviriego, Pedro, Gao, Zhen, Liu, Shanshan, Lombardi, Fabrizio

arXiv.org Artificial IntelligenceFeb-23-2025

In this paper we explore if ChatGPT can learn to count letters. Since the introduction of ChatGPT two years ago, Large Language Model (LLM) based tools have shown impressive capabilities to solve mathematical problems or to answer questions on almost any topic [1]. In fact, evaluation benchmarks have to be revised frequently to make then harder as LLM performance improves continuously [2]. The development of LLMs has also been hectic with new models presented by large companies such as Google with Gemini or Gemma, Meta with Llama or x.AI with Grok. OpenAI has also released newer versions and improvements of their Generative Pre-trained Transformer (GPT) family such as GPT4 [3] and its variants GPT4o and GPT4o1. Those foundational models are then adapted to answer questions or interact with users and complemented with other functionalities to implement conversational tools like ChatGPT. Despite these astonishing results, there are some simple tasks that LLMs struggle with, for example arithmetic operations [4] or even counting the occurrences of a given letter in a word. For example, many LLMs failed to count the number of "r" in strawberry

count letter, llm, madrid, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/MC.2024.3488313

2502.16705

Country:

Europe > Spain > Galicia > Madrid (0.09)
Asia > China > Tianjin Province > Tianjin (0.06)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
Asia > China > Sichuan Province > Chengdu (0.05)

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Why Do Large Language Models (LLMs) Struggle to Count Letters?

Fu, Tairan, Ferrando, Raquel, Conde, Javier, Arriaga, Carlos, Reviriego, Pedro

arXiv.org Artificial IntelligenceDec-19-2024

Large Language Models (LLMs) have achieved unprecedented performance on many complex tasks, being able, for example, to answer questions on almost any topic. However, they struggle with other simple tasks, such as counting the occurrences of letters in a word, as illustrated by the inability of many LLMs to count the number of "r" letters in "strawberry". Several works have studied this problem and linked it to the tokenization used by LLMs, to the intrinsic limitations of the attention mechanism, or to the lack of character-level training data. In this paper, we conduct an experimental study to evaluate the relations between the LLM errors when counting letters with 1) the frequency of the word and its components in the training dataset and 2) the complexity of the counting operation. We present a comprehensive analysis of the errors of LLMs when counting letter occurrences by evaluating a representative group of models over a large number of words. The results show a number of consistent trends in the models evaluated: 1) models are capable of recognizing the letters but not counting them; 2) the frequency of the word and tokens in the word does not have a significant impact on the LLM errors; 3) there is a positive correlation of letter frequency with errors, more frequent letters tend to have more counting errors, 4) the errors show a strong correlation with the number of letters or tokens in a word and 5) the strongest correlation occurs with the number of letters with counts larger than one, with most models being unable to correctly count words in which letters appear more than twice.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2412.18626

Country:

North America > United States (0.28)
Asia > China > Jiangsu Province (0.14)

Genre: Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback